Overview

Dataset Statistics

Number of Variables 9
Number of Rows 17952
Missing Cells 20295
Missing Cells (%) 12.6%
Duplicate Rows 22
Duplicate Rows (%) 0.1%
Total Size in Memory 11.1 MB
Average Row Size in Memory 647.4 B
Variable Types
  • Categorical: 9

Dataset Insights

Postal Code has 6409 (35.7%) missing values Missing
Property Size Description has 13886 (77.35%) missing values Missing
Date of Sale (dd/mm/yyyy) has a high cardinality: 303 distinct values High Cardinality
Address has a high cardinality: 17796 distinct values High Cardinality
Price (€) has a high cardinality: 3405 distinct values High Cardinality
County has constant value "Dublin" Constant
Date of Sale (dd/mm/yyyy) has constant length 10 Constant Length
County has constant length 6 Constant Length

Variables


Date of Sale (dd/mm/yyyy)

categorical

Approximate Distinct Count 303
Approximate Unique (%) 1.7%
Missing 0
Missing (%) 0.0%
Memory Size 1.3 MB

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 01/01/2017
2nd row 01/01/2017
3rd row 01/01/2017
4th row 01/01/2017
5th row 01/01/2017

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 143616
  • Date of Sale (dd/mm/yyyy) has words of constant length

Address

categorical

Approximate Distinct Count 17796
Approximate Unique (%) 99.1%
Missing 0
Missing (%) 0.0%
Memory Size 1.8 MB

Length

Mean 40.4322
Standard Deviation 7.3622
Median 39
Minimum 19
Maximum 84

Sample

1st row 1 PINTAIL HOUSE, R...
2nd row 11 PINTAIL HOUSE, ...
3rd row 124 SEAFIELD RD, C...
4th row 126 SEAFIELD RD, C...
5th row 128 SEAFIELD RD, C...

Letter

Count 543885
Lowercase Letter 97410
Space Separator 97824
Uppercase Letter 446475
Dash Punctuation 219
Decimal Number 48051
  • Address contains many words: 5105 words
  • The largest value (dublin) is over 3.7 times larger than the second largest value (rd)

Postal Code

categorical

Approximate Distinct Count 23
Approximate Unique (%) 0.2%
Missing 6409
Missing (%) 35.7%
Memory Size 829.8 KB
  • The largest value (Dublin 15) is over 1.61 times larger than the second largest value (Dublin 24)

Length

Mean 8.6058
Standard Deviation 0.5003
Median 9
Minimum 8
Maximum 20

Sample

1st row Dublin 3
2nd row Dublin 3
3rd row Dublin 3
4th row Dublin 3
5th row Dublin 3

Letter

Count 69284
Lowercase Letter 57740
Space Separator 11545
Uppercase Letter 11544
Dash Punctuation 0
Decimal Number 18507
  • The largest value (dublin) is over 7.5 times larger than the second largest value (15)

County

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.2 MB

Length

Mean 6
Standard Deviation 0
Median 6
Minimum 6
Maximum 6

Sample

1st row Dublin
2nd row Dublin
3rd row Dublin
4th row Dublin
5th row Dublin

Letter

Count 107712
Lowercase Letter 89760
Space Separator 0
Uppercase Letter 17952
Dash Punctuation 0
Decimal Number 0
  • County has words of constant length

Price (€)

categorical

Approximate Distinct Count 3405
Approximate Unique (%) 19.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.9 MB

Length

Mean 11.0565
Standard Deviation 0.4248
Median 11
Minimum 9
Maximum 14

Sample

1st row €242,424.00
2nd row €242,424.00
3rd row €535,500.00
4th row €630,000.00
5th row €535,500.00

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 143921
  • Price (€) contains many words: 3405 words

Not Full Market Price

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.1 MB
  • The largest value (No) is over 20.55 times larger than the second largest value (Yes)

Length

Mean 2.0464
Standard Deviation 0.2104
Median 2
Minimum 2
Maximum 3

Sample

1st row Yes
2nd row Yes
3rd row Yes
4th row No
5th row Yes

Letter

Count 36737
Lowercase Letter 18785
Space Separator 0
Uppercase Letter 17952
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 20.55 times larger than the second largest value (yes)

VAT Exclusive

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.2 MB
  • The largest value (No) is over 3.47 times larger than the second largest value (Yes)

Length

Mean 2.2237
Standard Deviation 0.4167
Median 2
Minimum 2
Maximum 3

Sample

1st row No
2nd row No
3rd row No
4th row No
5th row No

Letter

Count 39919
Lowercase Letter 21967
Space Separator 0
Uppercase Letter 17952
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (No, Yes) take over 50.0%
  • The largest value (no) is over 3.47 times larger than the second largest value (yes)

Description of Property

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.7 MB
  • The largest value (Second-Hand Dwelling house /Apartment) is over 3.41 times larger than the second largest value (New Dwelling house /Apartment)

Length

Mean 35.1877
Standard Deviation 3.3488
Median 37
Minimum 29
Maximum 37

Sample

1st row Second-Hand Dwelli...
2nd row Second-Hand Dwelli...
3rd row Second-Hand Dwelli...
4th row Second-Hand Dwelli...
5th row Second-Hand Dwelli...

Letter

Count 545993
Lowercase Letter 478252
Space Separator 53855
Uppercase Letter 67741
Dash Punctuation 13885
Decimal Number 0
  • The top 2 categories (Second-Hand Dwelling house /Apartment, New Dwelling house /Apartment) take over 50.0%

Property Size Description

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.1%
Missing 13886
Missing (%) 77.3%
Memory Size 486.7 KB
  • The largest value (greater than or equal to 38 sq metres and less than 125 sq metres) is over 3.79 times larger than the second largest value (greater than or equal to 125 sq metres)

Length

Mean 57.5829
Standard Deviation 13.3461
Median 65
Minimum 22
Maximum 65

Sample

1st row greater than or eq...
2nd row greater than or eq...
3rd row greater than or eq...
4th row greater than or eq...
5th row greater than or eq...

Letter

Count 169736
Lowercase Letter 169736
Space Separator 46264
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 18132

Interactions

Missing Values